37%
17.12.2014
performance from many perspectives (i.e., CPU, network, disk). The tool is called nmon
.
Nmon Overview
Nmon is short for “Nigel’s Monitor” and is a command-line tool that presents performance information ... Monitoring with Nmon
14%
19.11.2014
local or remote, in your browser using websocketd. Although I won't go into it in depth, Web-vmstat does a pretty good job monitoring problem servers. For example, if a node has been exhibiting strange
12%
27.08.2014
of I/O, but it is also useful to look further down the stack to see how the I/O requests appear at the various layers. One layer that is useful to monitor is the block layer, which is near the bottom
12%
30.07.2014
claims to “handle approximately 160,000 distinct metrics per minute running on two niagra-2 Sun servers on a very fast SAN” [1]. Graphite is thus best used in environments that need to monitor thousands
12%
02.07.2014
examining local I/O (if the nodes are doing local I/O)
checking whether any nodes are swapping
spot-monitoring the compute nodes
The real list of possible tasks is extensive, but anything you want
12%
18.06.2014
and more than 1PB of data? Moreover, the answers constantly change because users are adding, modifying, and deleting data, but understanding – or at the very least, monitoring – your filesystem holistically
12%
04.06.2014
(Network as a Service), Heat (Orchestration), and Ceilometer (monitoring).
The OpenStack dashboard. a.k.a. Horizon, does not create any data – either meta or user. The compute service Nova is a special case
14%
19.05.2014
with my /home/layton
directory on my local system (host = desktop
). I also access an HPC system that has its own /home/jlayton
directory (the login node is login1
). On the HPC system I only keep some
12%
06.05.2014
to the filesystem, which manages the resources and monitors the execution of the commands sent by a Hadoop-compatible application on the framework. These commands form jobs, and jobs are implemented as individual
88%
26.02.2014
In the continuing story of monitoring HPC systems, we look at code that measures process, network, and disk metrics.
...
In previous articles, I talked about cluster monitoring metrics and determining what you should monitor, then I looked at monitoring processor and memory metrics. In this article, I discuss three ... HPC, cluster management, monitoring, monitoring, statistics ...
In the continuing story of monitoring HPC systems, we look at code that measures process, network, and disk metrics.
... Monitoring HPC Systems: Process, Network, and Disk Metrics